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ABSTRACT 

Mixed-lineage leukemia (MLL) maintains the expres- 
sion of cellular memory genes during development, 
while leukemic MLL fusion proteins aberrantly 
maintain expression of hematopoietic stem cell 
program genes such as HOXA9 to cause leukemia. 
However, the molecular mechanism of gene activa- 
tion is unclear. Here we show that only two func- 
tional modules are necessary and sufficient for 
target recognition: those that bind to non- 
methylated CpGs and di-/tri-methylated histone H3 
lysine 36 (H3K36me2/3). An artificial protein 
composed of the two targeting modules and an 
interaction domain for AF4-family coactivators can 
functionally substitute for MLL fusion proteins. 
Because H3K36me2/3 markers are indicative of 
active transcription, MLL fusion proteins target 
previously active CpG-rich genes and activate tran- 
scription by recruiting coactivators thereto. Our 
results indicate that such chromatin context- 
dependent gene activation is the fundamental 
mechanism by which MLL fusion proteins main- 
tain the expression of the cellular memory/ 
hematopoietic stem cell program genes. 

INTRODUCTION 

The MLL gene encodes an epigenetic regulator that main- 
tains HOX gene expression during embryogenesis (1). 
HOX genes are so-called cellular memory genes because 
their expression is maintained throughout the develop- 
ment. In the hematopoietic lineage, the MLL protein 



(also known as HRX, MLL1 and KMT2A) activates the 
transcription of posterior HOXA genes (e.g. HOXA7-A10) 
(2,3). Posterior HOXA genes are hematopoietic stem cell 
(HSC) program genes (4) that promote the self-renewal of 
HSCs/immature progenitors (5). In normal hematopoiesis, 
their expression is maintained by MLL in the HSC/ 
immature progenitor compartments, which diminishes as 
cells differentiate. Chromosomal translocation generates 
MLL fusion genes, whose products constitutively 
activate the posterior HOXA genes, which results in 
aberrant self-renewal of hematopoietic progenitors, 
leading to leukemia (6). However, the precise molecular 
mechanism by which MLL and MLL fusion proteins 
activate their target genes remains unclear. MLL fusion 
proteins exert their oncogenic functions as a complex with 
the lens epithelium-derived growth factor (LEDGF) (also 
known as PSIP1) (7). Disruption of Psipl in mice causes 
homeotic skeletal transformation, a characteristic pheno- 
type caused by dysregulation of Hox gene expression (8). 
LEDGF also facilitates the specific integration of the HIV 
genome into transcriptionally active regions, presumably 
by tethering the HIV genome/integrase complex with tran- 
scriptionally active chromatin (9,10). In leukemia, MLL 
frequently fuses with the ALLl-fused gene from chromo- 
some 4 (AF4)- and eleven-nineteen leukemia (£7VX)-family 
genes, whose protein products are the components of 
a transcriptional coactivator, termed AEP (the AF4 
family/ENL family/PTEFb complex) (11-14). MLL 
fusion proteins constitutively recruit AEP components 
to activate transcription through direct interaction (e.g. 
MLL-ENL) or by an as-yet-uncharacterized indirect 
mechanism (e.g. MLL-AF6) (14). These previous studies 
postulate that MLL fusion proteins recognize their target 
genes through the MLL portion and constitutively recruit 
the AEP coactivator through the fusion partner portion. 
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Here, we report that MLL fusion proteins recognize a 
specific chromatin context to activate cellular memory 
genes. Our structure/function analyses demonstrate that 
MLL fusion proteins recognize their target genes 
through two functional domains: the PWWP domain of 
LEDGF and the CXXC domain of MLL that specifically 
bind to H3K36me2/3 and non-methylated CpGs, respect- 
ively. Because H3K36me2 and H3K36me3 are generally 
linked to gene activation (15,16), a previously transcribed 
gene containing non-methylated CpGs in its promoter is 
subjected to transcriptional activation by MLL fusion 
proteins. These studies provide a novel chromatin 
context-dependent gene activation mechanism by which 
MLL fusion proteins maintain cellular memory. 

MATERIALS AND METHODS 

Cell culture 

The human leukemia cell line ML-2 was cultured in RPMI 
1640 medium supplemented with 10% fetal calf serum and 
non-essential amino acids. 293T, plat-E and HeLa cells 
were cultured in Dulbecco's modified Eagle's medium sup- 
plemented with 10% fetal calf serum and non-essential 
amino acids. 

Vector construction 

The pMSCV-neo MLL-ENL, PME (7) and Bcl2- 
ires-E2A-HLF (17) vectors are described elsewhere. 
Various MLL fusion constructs were generated by restric- 
tion enzyme digestion or polymerase chain reaction 
(PCR)-based mutagenesis. The cDNAs were cloned into 
the pMSCV neo vector (for virus production) (Clontech, 
Mountain View, CA) or the pCMV5 vector (for transient 
expression). The expression vectors for FLAG-tagged 
GAL4 fusion proteins were constructed by PCR using 
pM (Clontech) as template and cloned into the pCMV5 
vector. Various LEDGF constructs were generated by 
PCR-based mutagenesis and cloned into the pcDNA4 
HisMax vector (Invitrogen, Carlsbad, CA). 

Western blotting 

Western blotting was performed as described elsewhere 
(14). The antibodies used in this study have been listed 
in Supplementary Table S2. 

Reverse transcription-PCR 

RNAs were prepared using the RNeasy kit (Qiagen, 
Valencia, CA) and reverse-transcribed with the 
Superscript III first strand cDNA synthesis kit with 
oligo(dT) primers (Invitrogen). Expression of PCE was 
confirmed by regular PCR using the primer pair 5 -AGA 
ATTCGATATCGGAAACATGACTCGCGATTTCAA 
ACC-3 / /5 / -TCACGTGTTCGCGATGCGACGGGCTTT 
CGTGGAGGAG-3'. Quantitative PCR (qPCR) for other 
genes was performed as described previously (14) using 
TaqMan probes [Gapdh (Mm99999915_gl), Hoxa7 
(Mm00657963_ml), Hoxa9 (Mm00439364_ml), HoxalO 
(Mm00433966_ml), Cbx5 (Mm00483092_ml), Myb 
(Mm00501741_ml) and Hmgb3 (Mm01377544_gH) 



(Applied Biosystems)]. The expression levels, normalized 
to Gapdh, were estimated using a standard curve and the 
relative quantification method, as described in ABI User 
Bulletin #2. 

Virus production 

The ecotropic retrovirus was produced using plat-E 
packaging cells (18). Supernatant medium containing the 
virus was harvested 24-48 h after transfection and used for 
viral transduction. 

Myeloid progenitor transformation assay 

The myeloid progenitor transformation assay was per- 
formed using cells harvested from the femurs and tibiae 
of C57BL/6 mice. C-kit-positive cells were enriched using 
magnetic beads conjugated with anti-c-kit antibody 
(Miltenyi Biotech, Bergisch Gladbach, Germany), 
transduced with recombinant retrovirus by spinoculation 
(19) and plated in methylcellulose medium (Iscove's 
modified Dulbecco's medium, 20% fetal bovine serum, 
1.6% methylcellulose, 100 uM (3-mercaptoethanol) con- 
taining stem cell factor, interleukin-3 and granulocyte 
macrophage colony-stimulating factor (lOng/ml). The 
colony-forming units per 10 4 plated cells at the third and 
fourth rounds were quantified after 5-7 days of culture. 

Subcellular fractionation and nucleosome 
co-immunoprecipitation analysis 

Subcellular fractions of 293T cells were obtained by CSK 
buffer extraction (20) and MNase treatment. 293T cells 
cultured in a 10-cm dish were resuspended in 1ml of 
CSK buffer [100 mM NaCl, 10 mM PIPES (pH 6.8), 
3mM MgCl 2 , ImM EGTA (pH 7.6), 0.3 M sucrose, 
0.5% Triton X-100, 5mM sodium butyrate, 0.5 mM 
DTT, EDTA-free protease inhibitor cocktail (Roche, 
Basel, Switzerland) and 2mM vanadyl ribonucleoside 
complexes (Sigma, St. Louis, MO)], incubated on ice for 
5min and then centrifuged (400g, 4°C, 4min). The super- 
natant (the soluble fraction) was transferred to a new tube, 
and the pellet was resuspended in 1 ml of MNase buffer 
[50mM Tris-HCl (pH 7.5), 4mM MgCl 2 , ImM CaCl 2 , 
0.3 M sucrose, 5mM sodium butyrate, 0.5 mM DTT 
and protease inhibitor cocktail]. One unit of MNase 
(Sigma) was added to the suspension, and the mixture 
was incubated at 37°C for 10-1 2 min to obtain 
mononucleosomes. The MNase reaction was stopped by 
adding EDTA (pH 8.0) at a final concentration of 20 mM. 
The reaction mixture was centrifuged (13 000rpm, 4°C, 
5 min) to separate the supernatant (the nucleosome 
fraction) and the pellet. The pellet was resuspended in 
elution buffer (1% sodium dodecyl sulfate, 50 mM 
NaHC0 3 ). The nucleosome fractions were subjected 
to immunoprecipitation (IP) using specific antibodies; 
washed five times with MNase buffer with 20 mM 
EDTA and analyzed by western blotting, Oriole staining 
(Bio-Rad, Hercules, CA), SYBR Green staining (Lonza, 
Basel, Switzerland) and mass spectrometry. Optionally, 
the precipitates were washed with MNase buffer three 
times, treated with DNase I (Qiagen) for 10 min at 37°C 
and washed five times with MNase buffer with 20 mM 
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EDTA. The precipitates were analyzed by western blotting 
and SYBR Green staining. 

Fractionation-assisted native chromatin 
immunoprecipitation 

293T cells cultured in a 10-cm dish or ML-2 cells (2 x 10 7 ) 
were suspended in 1 ml of CSK buffer to remove chroma- 
tin-unbound materials. The pellet was then resuspended in 
1 ml of MNase buffer and mixed with 0.2-1 U of MNase 
for 5-10 min at 37°C to obtain chromatin with DNA 
lengths of 150-3000 base pairs (bp). The MNase 
reaction was stopped by adding EDTA (pH 8.0) at a 
final concentration of 20 mM, and then the mixture 
was placed on ice. Lysis buffer (1ml) [10% glycerol, 
20 mM sodium phosphate (pH 7.0), 250 mM NaCl, 
30 mM sodium pyrophosphate, 0.1% Nonidet P-40, 
5mM EDTA, lOmM NaF and protease inhibitor 
cocktail] (21) was added to solubilize the proteins, and 
the mixture was spun down to remove insoluble 
material. The supernatant was then subjected to IP. 
Approximately 1 jig of the primary antibodies listed in 
Supplementary Table S2 was added to 400 ul of chromatin 
suspension, and the mixture was incubated for 4-6 h at 
4°C. Ten microliters of protein-G magnetic beads 
(Invitrogen) was added to each sample, and the mixture 
was incubated for 2h with rotation. The beads were 
washed five times with 500 pi of a 1:1 mixture of MNase 
buffer containing 20 mM EDTA and lysis buffer. For 
histone modifications, incubation and washing were per- 
formed with higher NaCl concentrations (plus 400 mM) to 
strip chroma tin-binding proteins. DNAs were harvested 
from the precipitates by dissolving in elution buffer 
followed by phenol/chloroform/iso-amyl alcohol extrac- 
tion and ethanol precipitation. Deep sequencing of the 
precipitated DNAs was performed using the Illumina 
genome analyzer IIx at the Joint Usage/Research Center 
(RIRBM), Hiroshima University. qPCR was performed 
on the precipitated DNAs using the custom-made primer 
sets described in Supplementary Table S3. The value 
relative to the input was determined using a standard 
curve and the relative quantification method as described 
in the ABI User Bulletin #2. Optionally, co-precipitated 
proteins were harvested by dissolving the immunopre- 
cipitates in elution buffer, mixed with an equal volume 
of 2x sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis sample buffer and subjected to western blotting. 

Liquid chromatography-tandem mass 
spectrometry analysis 

The method for trypsin digestion of protein has been 
described previously (22). Tandem mass spectrometry 
analysis was performed using an LTQ Orbitrap ELITE 
ETD mass spectrometer (Thermo Fisher Scientific). The 
methods used for liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) were slightly modified from 
those described previously (23). The mass spectrometer 
was operated in data-dependent acquisition mode in 
which MS acquisition with a mass range of m/z 400- 
1000 was automatically switched to MS/MS acquisition 
under the control of Xcalibur software. The top four 



precursor ions in the MS scan were selected by Orbitrap, 
with resolution R = 240 000, and those in the subsequent 
MS/MS scans, with an ion trap in automated gain control 
(AGC) mode, where AGC values were 1 x 10 6 and 
1.00 x 10 4 for full MS and MS/MS, respectively. For frag- 
mentation, electron transfer dissociation was used. 

In vivo leukemogenesis assay 

C-kit-positive cells (2 x 10 5 ) prepared from mouse femurs 
and tibiae were transduced with retrovirus by 
spinoculation and intravenously transplanted into 
sublethally irradiated (two doses of 500 rad in 2 days) 
C57BL/6 mice. Moribund mice were sacrificed, and the 
spleen cells were either subjected to cytospin preparation 
followed by May-Grunwald/Giemsa staining or tempor- 
arily cultured in methylcellulose medium in the presence 
of G418 (lmg/ml) to remove untransformed cells, and 
then subjected to secondary transplantation or reverse 
transcription-PCR (RT-PCR) analysis. 

CpG island recovery assay 

CpG island recovery assays for non-methylated CpGs 
(CIRA) and methylated CpGs (MIRA) were performed 
using the Unmethyl Collector kit and Methyl Collector 
Ultra kit, respectively (Active Motif, Carlsbad, CA). 
Deep sequencing after CIRA and MIRA was carried out 
at the Joint Usage/Research Center (RIRBM), Hiroshima 
University. 

Transactivation assay 

Transactivation assays using the pFR-luc reporter 
(Clontech) were performed as described elsewhere (14). 
Relative luciferase activities were normalized to Renilla 
luciferase activity and expressed in terms of the average 
values and standard deviations (SDs) of triplicate deter- 
minations relative to the GAL4 DNA binding domain 
controls. 



RESULTS 

Murine leukemia models define the major functional 
modules required for leukemic transformation 
by MLL-ENL 

MLL fusion proteins form a trimeric complex with menin 
and LEDGF through the MLL portion (7). Because MLL 
fusion proteins associate with LEDGF through menin as a 
mediator, an MLL-ENL mutant (MEdNter) lacking the 
high-affinity menin-binding motif failed to transform 
hematopoietic progenitors in a myeloid progenitor trans- 
formation assay (Figure 1A), in which successful trans- 
formation is represented by vigorous colony-forming 
activity in the third and fourth rounds of plating, and 
elevated expression of Hoxa9 in first-round colonies 
(Figure IB). An artificial MLL-ENL fusion protein 
(PME), in which high-affinity menin-binding motif is 
replaced by the PWWP domain of LEDGF, transformed 
myeloid progenitors despite its inability to form the 
trimeric complex (7). Therefore, the PWWP domain is 
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Figure 1. Major functional domains required for leukemogenesis by MLL-ENL. (A) Experimental scheme for the myeloid progenitor transformation 
assay. (B) Transforming ability of various MLL-ENL (ME) mutants. The schematic structures of LEDGF, MENIN, ENL and various ME mutants are 
shown, with the key functional modules indicated (left). P: the PWWP domain. C: the CXXC domain. E: the ENL portion included in MLL-ENL. The 
numbers of colony-forming units (CFUs) at the third and fourth rounds of replating are shown with error bars (SD of >3 independent experiments) 
(middle). Hoxa9 expression is presented relative to the value of ME (arbitrarily set at 100%) with error bars (SD of triplicate PCRs) (right). (C) Protein 
expression of the PWWP-MLL-ENL (PME) and PCE mutants in the packaging cells. The PME and PCE proteins were visualized using the anti-ENL 
antibody. Molecular standards are shown on the left. (D) Experimental scheme for the in vivo leukemogenesis assay. (E) Survival of the transplanted 
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the single functional module within menin and LEDGF 
that is required for MLL-ENL-dependent transformation. 

To identify the structural requirements of MLL fusion 
proteins besides the PWWP domain, we examined the 
transforming ability of a series of PME mutants with 
various internal deletions in the MLL gene. A mutant 
lacking the CXXC domain (PMEdC) failed to transform 
myeloid progenitors (Figure IB and C). Conversely, a 
mutant (PCE) consisting only of the PWWP domain, the 
CXXC domain and the ENL portion activated Hoxa9 
expression and immortalized myeloid progenitors. The 
PCE mutant induced leukemia in recipient mice within 
3-4 months in the in vivo leukemogenesis assay 
(Figure 1D-F). The PCE-induced leukemic blasts had 
monocytic features (Figure 1G) and elevated expression 
of posterior Hoxa genes and leukemia stem cell signature 
genes including Myb, Hmgb3 and Cbx5 (Supplementary 
Figure SI), which are major characteristics of MLL- 
associated myeloid leukemias (24). The leukemic blasts 
recapitulated the same disease in secondary recipients 
with a much shorter latency period (Figure IE). These 
results clearly show that PCE is as functional as MLL- 
ENL for leukemic transformation. Thus, the PWWP 
domain, the CXXC domain and the ENL portion 
are the major structural elements required for MLL- 
associated leukemogenesis. 

Binding to a specific nucleosome through the PWWP 
domain is essential for MLL fusion-dependent 
gene activation 

To further analyze the function of the PWWP domain, we 
next tested a series of artificial PCE derivatives with 
various alterations in the PWWP domain for the trans- 
forming ability of myeloid progenitors (Figure 2A and 
B). A mutant (P'CE) with the minimum structure of the 
PWWP domain exhibited this transformation ability, 
whereas an alanine substitution of the evolutionarily 
conserved tryptophan (W21) resulted in a loss of the trans- 
formation activity. The PWWP domain of LEDGF has 
been implicated in chromatin binding (25,26). To examine 
its chromatin-binding capacity, we transiently expressed 
LEDGF and its mutant (dPWWP) lacking the PWWP 
domain in 293T cells (Figure 2C), and we divided the 
cellular components into three subfractions (Figure 2D 
and Supplementary Figure S2A). In this procedure, 

(i) chromatin-unbound soluble materials were extracted 
by cytoskeletal (CSK) buffer (the soluble fraction); 

(ii) chromatin and its associated factors were eluted by 
DNA fragmentation with micrococcal nuclease (MNase) 
(the nucleosome fraction); and (iii) insoluble materials 
after MNase treatment were solubilized by denaturing 
detergent (the nuclear matrix fraction). Nearly all the 
nucleosomes in the nucleosome fraction were 
mononucleosomes based on their DNA length (~ 150 bp) 



(Figure 2E, top panels). LEDGF substantially localized to 
the nucleosome fraction (Figure 2E, bottom panels) and 
was able to pull down mononucleosomes (Figure 2F), 
whereas dPWWP mainly localized to the soluble 
fraction (Figure 2E) and failed to pull down nucleosomes 
(Figure 2F). The PWWP domain (P') by itself localized in 
the nucleosome fraction (Figure 2G and H) and pulled 
down endogenous nucleosomes (Figure 21). Previously, 
one group showed that LEDGF had specifically 
associated with the histone H3 peptide containing 
either H3K36me2 or H3K36me3 (25), while another 
group showed that the recombinant PWWP domain of 
LEDGF directly binds to H3K36me3 but not to 
H3K36me2 in vitro (26), suggesting that the peptide pull- 
down assay tends to overrepresent H3K36me3-PWWP 
interaction, depending on the assay conditions. Our 
nucleosome co-immunoprecipitation (co-IP) assay, which 
can detect protein-nucleosome interaction under physio- 
logical conditions, showed that the nucleosomes co- 
precipitated with V contained high amounts of not only 
H3K36me3 but also H3K36me2 (Figure 21), indicating 
that the PWWP domain of LEDGF is capable of 
binding to either H3K36me2 or H3K36me3 in vivo. 
In accordance with these results, an artificial protein 
(P'C) composed of the PWWP domain and the CXXC 
domain was substantially localized in the nucleosome 
fraction (Figures 2G and H) and co-precipitated with 
mononucleosomes and DNAs (Supplementary Figure 
S2B and C). The W21A substitution mutants of P' 
and FC exclusively localized to the soluble fraction 
(Figures 2G and H) and therefore were not used for the 
nucleosome co-IP analysis (Supplementary Figures S2D 
and E). These results indicate that the PWWP domain, 
and not the CXXC domain, is mainly responsible for 
stable association with chromatin. The purified PC- 
nucleosome complex contained no other proteins but 
histones at the stoichiometric level (Supplementary 
Figure S2B). Complete removal of the DNAs by 
DNase I treatment did not abolish the FC-nucleosome 
interaction (Figure 2J) or P'-nucleosome interaction 
(Supplementary Figure S2F). Thus, the PWWP domain 
directly associates with histones. Taken together, these 
findings show that the binding ability of the PWWP 
domain to specific histones is required for MLL fusion- 
dependent gene activation. 

The PWWP domain and the CXXC domain target 
transcriptionally active promoters containing either 
H3K36me2 or H3K36me3 

We next examined the functional similarities among 
various PWWP domains. HRP2 is a close homolog 
of LEDGF and has a highly homologous PWWP 
domain (Figure 3 A). The bromodomain and PHD 
finger-containing 1 (BRPF1) protein is an associated 



Figure 1. Continued 

animals in the in vivo leukemogenesis assay. Light gray, vector; black, first PCE transplantation; dark gray, second PCE transplantation, n, number 
of animals analyzed. (F) Expression of PCE in the PCE-induced leukemia cells (PCE-LCs). Leukemic cells derived from moribund mice were cultured 
ex vivo in the presence of G418 and analyzed by RT-PCR. The PCE plasmid, PCE-immortalized cells (PCE-ICs) and Bcl2/E2A-HLF-immortalized 
cells (BiEH-ICs) were included for comparison, nsp, non-specific band. (G) Morphology of PCE-LCs in the spleen. The splenic cells harvested from a 
PCE-leukemic mouse were stained by the May-Grunwald/Giemsa staining method. Normal spleen is included for comparison. Scale bar, 50|im. 
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Figure 2. Nucleosome binding by the PWWP domain is essential for MLL-ENL-dependent transformation. (A) Transforming ability of PCE 
mutants containing various mutations within the PWWP domain. P ; : the minimum structure of the PWWP domain. The schematic structures of 
LEDGF and various PCE mutants are shown (left). The CFUs at the third and fourth rounds of replating are shown with error bars (SD of >3 
independent experiments) (middle). Hoxa9 expression in the first-round colonies is expressed relative to that of PCE (arbitrarily set at 100%) with 
error bars (SD of triplicate PCRs) (right). (B) Protein expression of the PCE mutants in the packaging cells. The PCE mutant proteins were 
visualized using the anti-ENL antibody. (C) The schematic structures of LEDGF and its mutant lacking the PWWP domain (dPWWP). An Xpress 
tag (gray flag) is fused to the N-terminal end of the LEDGF proteins. Their ability to associate with nucleosomes is summarized. (D) Experimental 
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factor of the MOZ histone acetyltransferase (27) and has a 
less conserved PWWP domain, which has been recently 
shown to directly bind to histone H3 peptide containing 
H3K36me2 or H3K36me3 in vitro (28). To investigate the 
functional differences and similarities of the PWWP 
domains of LEDGF, HRP2 and BRPF1, we tested 
whether these PWWP domains are interchangeable in 
the myeloid progenitor transformation assay. The P'CE 
mutants whose PWWP domain is replaced by those of 
HRP2 and LEDGF successfully activated Hoxa9 expres- 
sion and immortalized myeloid progenitors (Figure 3A 
and B). Nucleosome co-IP assay with the P'C mutants 
containing the PWWP domains of LEDGF and BRPF1 
specifically enriched nucleosomes containing H3K36me2 
more efficiently than those containing H3K36me3 
(Figure 3C and D). Consistent with these results, mass 
spectrometry analysis indicated that two-thirds of 
FC-bound nucleosomes contained the H3K36me2 
marker, while the H3K36me3 marker was minor 
compared with P'-bound nucleosomes (Supplementary 
Figure S3). In addition, chromatin modifications charac- 
teristic to active promoters such as di-methylated histone 
H3 lysine 4 (H3K4me2) (Figures 21 and 3D), acetylated 
histone H3 lysine 9 (H3K9ac) and acetylated histone H3 
lysine 27 (H3K27ac) (Supplementary Figure S3) were 
enriched in P'C-bound nucleosomes, suggesting that the 
CXXC domain confers affinity toward active promoters. 
To determine the targeting ability of these artificial 
proteins to known MLL target genes in vivo, we per- 
formed chromatin immunoprecipitation (ChIP) followed 
by qPCR on 293T cells transiently expressing those PC 
mutants (Figure 3C and E). To this end, we devised a 
highly sensitive native ChIP protocol by altering the afore- 
mentioned nucleosome IP protocol. In this ChIP method, 
we resuspended the cells in CSK buffer to remove the 
chromatin-unbound materials and treated the cells with 
MNase to digest the DNAs to achieve the average 
length of 500-1000 bp. The highly solubilized chromatin 
fraction was extracted with lysis buffer and then subjected 
to IP. We designated this protocol fractionation-assisted 
native ChIP, but hereafter it is referred to as ChIP. 
Ecotropic viral integration site 1 (EVU), paired-like 
homeodomain transcription factor 2 (PITX2), HOXA7 
and HOXA9 were previously reported as potential MLL 



target genes (14,28-30). Although the chromatin environ- 
ment in 293T cells might be different from that in 
leukemia cells, the P'C mutants, but not P'(W21A)C, 
were localized at the transcription start sites (TSSs) 
of PITX2, HOXA7 and HOXA9, but not EVU 
(Figure 3E). These genes showed enrichment of 
H3K4me2 near the TSS, consistent with the observation 
that P'C-bound nucleosomes contain high levels of 
H3K4me2 modification (Figure 3D-F). H3K36me2 was 
present in all of the P'C mutant-occupied loci, while 
H3K36me3 was only found at the HOXA9 locus 
(Figure 3F), supporting the notion that the PWWP 
domain binds either H3K36me2 or H3K36me3 in vivo. 
The presence of RNA polymerase II (RNAPII) indicated 
that PITX2, HOXA7 and HOXA9, but not EVU, were 
transcriptionally active in 293T cells (Figure 3F). Thus, 
the PWWP domain and the CXXC domain collabora- 
tively target the promoter-proximal regions of transcrip- 
tionally active genes containing either H3K36me2 or 
H3K36me3. 

The chromatin context comprising H3K36me2/3 and 
non-methylated CpGs is targeted by MLL fusion proteins 

Next, we examined the structural requirements of the 
CXXC domain. A deletion mutant (P'CE) that lacks the 
sequences flanking the CXXC domain successfully trans- 
formed myeloid progenitors (Figure 4A and B). Therefore, 
the CXXC domain, but not its flanking sequences, is 
required for the proper targeting of MLL-ENL. The 
CXXC domain of MLL directly binds to non-methylated 
CpGs (31-34). Point mutations that have been proved to 
abolish the binding ability to non-methylated CpGs 
(C1155A and K1186A) (31) resulted in a loss of transform- 
ing activity (Figure 4A and B), while the Q1162A substitu- 
tion, which has no effect on non-methylated CpG-specific 
binding, did not compromise transformation. Mutations in 
the KFGG motif (K1178A and K1179A) that severely at- 
tenuate the binding ability to non-methylated CpGs (31) 
also abolished transforming activity. The artificial protein 
(P'C') composed of the minimum structures of the PWWP 
domain and the CXXC domain localized in the nucleosome 
fraction (Figure 4C and D) and associated with nucleo- 
somes (Figure 4E). The P'C mutants carrying CpG 
binding-deficient mutations retained the nucleosome 



Figure 2. Continued 

scheme of subcellular fractionation. (E) Distribution of DNAs/RNAs (top) and LEDGF proteins (bottom) in the three subfractions. 293T cells 
transiently expressing the indicated proteins were subfractionated into the soluble fraction (SOL), the nucleosome fraction (NUC) and the nuclear 
matrix fraction (NM). The LEDGF proteins were visualized using anti-Xpress antibody. (F) Nucleosomes associated with LEDGF and dPWWP. 
Nucleosome co-IP assay was performed with the nucleosome fractions of 293T cells transiently expressing the LEDGF proteins. The LEDGF mutant 
proteins and histone H3 proteins were visualized using anti-Xpress and histone H3 antibodies, respectively. (G) The schematic structures of the P' 
and the P'C mutants. An Xpress tag (gray flag) is fused to the N-terminal end of P' mutants. A FLAG tag (black flag) is fused to the C-terminal end 
of the PC mutants. The localization to the nucleosome fraction of each mutant and its ability to bind nucleosomes are summarized. N.A., not 
applicable. (H) Subcellular distribution of the P' and P'C mutants. Experiments were performed as in (E). The P' mutants were visualized by using 
the anti-Xpress antibody. The P'C mutants were visualized by using the anti-FLAG antibody. (I) Nucleosomes associated with P'. P' and its 
associating nucleosomes were co-purified by co-IP from the NUC fraction of 293T cells transiently expressing P' as in (F). Nucleosomes were 
visualized by antibodies specific for mono-methylated histone H3 lysine 36 (H3K36mel), H3K36me2 and H3K36me3 or by anti-histone H2B and 
-H3 antibodies. Di-methylated histone H3 lysine 4 (H3K4me2), tri-methylated histone H3 lysine 27 (H3K27me3) and acetylated histone H3 lysine 27 
(H3K27ac) were also analyzed for comparison. (J) The role of DNAs in P'C-nucleosome interaction. P'C and its associating nucleosomes were co- 
purified by co-IP using the anti-FLAG antibody from the NUC fraction of 293T cells expressing P'C. The precipitates were treated with or without 
DNase I (300 kU/ml) for lOmin and washed to remove unbound material. DNAs co-precipitated with P'C were visualized by SYBR Green staining. 
Nucleosomes were visualized by antibodies specific for H3K36me2 and H3K36me3 modification or by the anti-histone-H3 antibody. 
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Figure 3. The PWWP domain and the CXXC domain target active promoters containing H3K36me2/3 in vivo. (A) Transforming ability of P'CE 
mutants containing three different PWWP domains. The schematic structures of LEDGF, HRP2, BRPF1 and various P'CE mutants are shown (left). 
Sequence alignment of the three PWWP domains is shown. Asterisk, conserved residue. The tryptophan residue mutated in Figure 2 A is indicated by 
#. The CFUs at the third and fourth rounds of replating are shown with error bars (SD of >3 independent experiments) (middle). Hoxa9 expression 
in the first-round colonies is expressed relative to that of P'CE (arbitrarily set at 100%) with error bars (SD of triplicate PCRs) (right). (B) Protein 
expression of the P'CE mutants carrying non-LEDGF PWWP domains in the packaging cells. The P'CE mutant proteins were visualized using the 
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binding activity because they had an intact PWWP domain 
(Figure 4C-E). To investigate the CpG methylation status 
of the MLL target genes, we performed CIRA using the 
recombinant CXXC domain protein followed by qPCR, 
which showed that PITX2, HOXA7 and HOXA9 loci con- 
tained ample amounts of non-methylated CpGs near the 
TSSs in 293T cells (Figure 4F). ChlP-qPCR analysis of 
29 3T cells transiently expressing these PC mutants 
showed that only the wild-type control, not the CpG 
binding-deficient mutants, was able to associate with the 
promoters of MLL target genes (Figure 4G). Thus, 
binding to non-methylated CpGs through the CXXC 
domain is essential for the proper targeting of MLL 
fusion proteins. 

There are dozens of CXXC domain-containing proteins 
in mammals. To examine the functional similarity of dif- 
ferent CXXC domains, we generated several domain swap 
mutants of P'C'E whose CXXC domain is replaced by 
other CXXC domains (Figure 5A and B). The CXXC 
domains of MLL2 (also known as HRX2, MLL4 and 
KMT2B) (35), DNA (cytosine-5)-methyltransferase 1 
(DNMT1) (36) and F-box and leucine-rich repeat 
protein 11 (FBXL11; also known as KDM2A) (37) were 
previously shown to bind to non-methylated CpGs. P'C'E 
mutants carrying these three non-MLL CXXC domains 
were competent for leukemic transformation (Figure 5A 
and B). Furthermore, the PC mutant with the CXXC 
domain of FBXL11 (Figure 5C and Supplementary 
Figure S4) was able to associate with MLL target genes 
in 293T cells (Figure 5D). These results indicate that the 
three different CXXC domains are functionally equivalent 
in terms of MLL-dependent gene activation, confirming 
the notion that the binding ability to non-methylated 
CpGs, a common function of those three CXXC 
domains, is required for targeting of MLL fusion proteins. 

Taken together, the minimum functional modules 
required for the targeting of MLL fusion protein are the 
PWWP domain and the CXXC domain, which bind to 
nucleosomes with H3K36me2/3 and non-methylated 
CpGs, respectively. Therefore, the MLL fusion complex 
targets a chromatin context in which H3K36me2/3 and 
non-methylated CpGs coexist. 

The MLL-AF6 complex localizes at transcriptionally 
active promoters containing H3K36me2/3 and 
non-methylated CpGs in vivo 

To determine the genomic landscape of the MLL target 
genes under physiological conditions, we performed a 
genome-wide ChIP analysis of cells that express an 
MLL fusion protein endogenously. ML-2 cells express 



MLL-AF6, but not wild-type MLL (38,39); therefore, 
the ChIP signal obtained from ML-2 cells using the anti- 
MLL antibody can be ascribed to the presence of MLL- 
AF6. Our ChIP analysis followed by deep sequencing 
(ChlP-seq) using the anti-MLL antibody identified 154 
TSSs of 131 MLL-AF6 target genes (Supplementary 
Table SI) by virtue of enriched ChIP signals of MLL- 
AF6 near the TSSs. These regions included previously 
identified MLL target genes such as HOXA9, HOXA10, 
cyclin- dependent kinase inhibitor IB (CDKN1B), 
CDKN2C, transcription factor 4 (TCF4) and runt-related 
transcription factor 2 (RUNX2) (14,40), and some newly 
identified genes such as special AT-rich sequence-binding 
protein- 1 (SATB1), ribosomal protein L31 (RPL31) 
and RPL10A (Supplementary Figure S5A and 
Supplementary Table SI). Gene ontology analysis of 
these MLL-AF6 target genes indicates that MLL-AF6 
mainly associates with genes involved in gene expression 
and translation (Supplementary Figure S5B). The average 
distribution of MLL-AF6 at the MLL target loci indicates 
that MLL-AF6 localizes near the TSSs (Figure 6A). MLL 
target loci contain a relatively large amount of CpG 
sequence near the TSSs. CIRA and MIRA followed by 
deep sequencing (CIRA-seq and MIRA-seq respectively) 
showed that MLL target loci are rich with non-methylated 
CpGs but are devoid of methylated CpGs. MLL-AF6- 
occupied loci also exhibited an enrichment of RNAPII 
and H3K36me3 (Figure 6A and Supplementary Figure 
S5C). Thus, endogenously expressed MLL-AF6 specific- 
ally localizes at CpG-rich promoters of transcriptionally 
active genes. 

ChlP-qPCR and CIRA-qPCR analyses on each gene 
confirmed that the MLL-AF6 complex localized at the 
major MLL target loci including HOXA7, CDKN2C, 
CDKN1B and HOXA9 but was completely absent at the 
gene loci of PITX2 and MEISf which are potentially 
regulated by MLL (14) but are transcriptionally inactive 
in this cell line (Figure 6B and C and Supplementary 
Figure S5D and E). MLL-AF6, menin and LEDGF 
co-localized near the TSSs of MLL target genes where 
non-methylated CpGs are abundant (Figure 6B and 
Supplementary Figure S5D). H3K36me2 was ubiquitously 
present and relatively enriched at the transcriptionally active 
loci. The MLL-AF6 complex localized at the promoter- 
proximal regions of HOXA7 and CDKN2C, which were 
H3K36me2-rich but H3K36me3-poor (Figure 6B and C). 
On the other hand, the MLL-AF6 complex was highly 
enriched near the TSS of HOXA9, where H3K36me2 was 
scarce but H3K36me3 was abundant (Figure 6B and C and 
Supplementary Figure S5D and E). These results confirm 



Figure 3. Continued 

anti-ENL antibody. (C) The schematic structures of various PC mutants. A FLAG tag (black flag) is fused to the C-terminal end of the P'C 
mutants. Their abilities to bind nucleosomes and target promoters are summarized. (D) Nucleosomes associated with the PC mutants. The P'C 
mutants and their associating nucleosomes were co-purified by co-IP using the anti-FLAG antibody from the NUC fraction of 293T cells transiently 
expressing the P'C mutants. Nucleosomes were visualized by antibodies specific for histone H3 or the indicated modifications. Nucleosomes co- 
purified with FLAG-tagged histone H4 were also analyzed for comparison. (E) Chromatin targeting ability of the PC mutants. Genomic localization 
of the PC mutants was analyzed by ChIP using the anti-FLAG antibody in 293T cells transiently expressing the PC mutants. The precipitated 
DNAs were analyzed by qPCR using specific probes for pre-TSS (-1.0 to -0.5kb of TSS), TSS (0 to +0.5kb of TSS) and post-TSS (+1.0 to 1.5 kb 
of TSS) of the indicated genes. (F) Epigenetic status of MLL target genes in 293T cells. ChIP analysis was performed on 293T cells using antibodies 
specific for H3K36me2, H3K36me3, H3K4me2 and RNAPII. The precipitated DNAs were analyzed by qPCR as in (E). 
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Figure 4. MLL-ENL recognizes the chromatin context with H3K36me2/3 and non-methylated CpGs. (A) Transforming ability of P'CE mutants 
with various mutations within the CXXC domain. The schematic structures of MLL and various P'CE mutants are shown (left). C: the minimum 
structure of the CXXC domain; C'+, the minimum structure of the CXXC domain and its C-terminal flanking region. Their ability to bind to non- 
methylated CpGs is summarized. The CFUs at the third and fourth rounds of replating are shown with error bars (SD of >3 independent 
experiments) (middle). Hoxa9 expression in the first-round colonies is expressed relative to the value of P'CE (arbitrarily set at 100%) with error 
bars (SD of triplicate PCRs) (right). (B) Protein expression of the P'CE mutants in the packaging cells. The P'CE mutant proteins were visualized by 
the anti-ENL antibody. (C) The schematic structures of various P'C mutants. A FLAG tag (black flag) is fused to the C-terminal end of the P'C 
mutants. Their abilities to bind non-methylated CpGs, nucleosomes and target promoters are summarized. (D) Subcellular distribution of the P'C 
mutants. Experiments were performed as in Figure 2E. The P'C mutants were visualized by the anti-FLAG antibody. (E) Nucleosomes associated 
with the P'C mutants. The P'C mutants and their associating nucleosomes were analyzed as in Figure 2F. (F) Distribution of non-methylated CpGs 
at the MLL target genes. Genomic DNAs of 293T cells were analyzed by CIRA. The precipitated DNAs were analyzed by qPCR as in Figure 3E. 
(G) Chromatin targeting ability of the P'C mutants. Genomic localization of the P'C mutants was analyzed by ChIP as in Figure 3E. 
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Figure 5. Non-MLL CXXC domains can functionally substitute for the MLL CXXC domain. (A) Transforming ability of P'C'E mutants 
with various non-MLL CXXC domains. The schematic structures of MLL, MLL2, DNMT1, FBXL11 and various P'CE mutants are shown 
(left). Sequence alignment of the CXXC domains of MLL, MLL2, DNMT1 and FBXL11. The mutated residues in Figure 4A are indicated by 
#. Asterisk, conserved residue. The CFUs at the third and fourth rounds of replating are shown with error bars (SD of >3 independent experiments) 
(middle). Hoxa9 expression in the first-round colonies is expressed relative to the value of P'CE (arbitrarily set at 100%) with error bars (SD of 
triplicate PCRs) (right). (B) Protein expression of the P'CE mutants carrying non-MLL CXXC domains in the packaging cells. The P'CE mutant 
proteins were visualized by the anti-ENL antibody. (C) The schematic structure of the P ; C mutant carrying the CXXC domain of FBXL11 
[P'C^FBXLll)]. A FLAG tag is fused to the C-terminal end of the CXXC domain (black flag). Its ability to associate with the tar- 
get promoters is indicated. (D) Chromatin targeting ability of P'C^FBXLl 1). Genomic localization of FC(FBXLll) was analyzed by ChIP as 
in Figure 3E. 



the notion that the presence of either H3K36me2 or 
H3K36me3 is required for targeting of the MLL fusion 
complex. Hence, our data strongly indicate that the MLL- 
fusion complex targets the promoter-proximal regions con- 
taining H3K36me2/3 and non-methylated CpGs under 
physiological conditions. 

It should be noted that the MLL fusion complex was 
also found by ChlP-qPCR at a housekeeping gene, 



glycer aldehyde phosphate dehydrogenase (GAPDH), 
whose promoter-proximal region also contained 
H3K36me2/3 and non-methylated CpGs near the TSS 
(Figure 6B and Supplementary Figure S5D). However, 
the localization of the MLL-AF6 complex was much 
weaker than that at well-known MLL target genes. 
These results suggest that the extent of occupation by 
the MLL fusion complex is influenced not only by the 
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Figure 6. Genomic landscape of MLL-AF6 target genes in ML-2 cells. (A) The average distribution of MLL-AF6, RNAPII, CpG dinucleotide, non- 
methylated CpGs and methylated CpGs at the 154 MLL-AF6-occupied TSSs. The input data and all TSS data are included for comparison. ChlP- 
seq or CIRA/MIRA-seq tags were clustered into 100-bp bins. On the x axis, the position of the associated TSS is designated as Obp. The y axis 
indicates the normalized ChlP-seq tag count (ppm). The average number of CpG dinucleotide in each bin was also plotted. (B) Genomic localization 
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Figure 7. MLL fusion-dependent gene activation is mediated by AF4-family coactivators. (A) Transforming ability of P'C+ mutants fused with 
ENL and AF5q31. The schematic structures of ENL, AF5q31 and various P'C^ mutants are shown (left). An HA tag (white flag) is fused to the 
C-terminal end of the P'C^ structure. E dAHD:the ENL portion lacking the ANC1 homology domain (AHD). E': the minimum transformation 
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bars (SD of triplicate PCRs) (right). (B) Protein expression of the P'C'+ mutants fused with ENL and AF5q31 in the packaging cells. The P'C^ 
mutant proteins were visualized by the anti-HA antibody. (C) Association of AHD and the AF4-family proteins. A series of FLAG-tagged GAL4- 
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analyzed. (F) Models of MLL-ENL-dependent gene activation. 
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amounts of H3K36me2/3 and non-methylated CpGs 
but also by other factors or environments that are yet 
undetermined. 

An interaction domain for AF4-family coactivators and 
the targeting modules are the minimum elements required 
for MLL-ENL-dependent leukemogenesis 

Next, we analyzed the structural requirements of the 
fusion partner portion. We tested the transforming 
ability of various fusion partners that were fused with 
the minimum targeting structure (P'C'+) composed of 
the PWWP domain and the CXXC domain (the 
C-terminal flanking sequence of the CXXC domain was 
included to increase the protein stability). An artificial 
protein (P'C+E') composed of the minimum targeting 
structure and the ANC1 homology domain (AHD) of 
ENL activated Hoxa9 expression and transformed 
myeloid progenitors, whereas a P'C+E mutant lacking a 
part of AHD failed to transform myeloid progenitors 
(Figure 7A and B). AHD is a binding platform for AF4 
family transcriptional coactivators (14). Indeed, the GAL4 
DNA binding domain fused with AHD (GAL4-E') 
associated with AF5q31 (also known as AFF4 and 
MCEF) (Figure 7C and Supplementary Figure S6), 
which is a member of the AF4 family, and exhibited tran- 
scriptional activation activity (Figure 7D). Removing 
AHD from P'C+E' resulted in the loss of transforming 
ability (Figure 7 A and B), whereas adding back the 
AF5q31 portion restored the transforming ability, 
indicating that association with the AF4-family 
coactivator is essential for MLL-ENL-dependent trans- 
formation. In vivo leukemogenesis assay showed that 
P'C+E' induced leukemia in vivo (Figure 7E). Therefore, 
the PWWP domain, the CXXC domain and AHD are the 
minimum essential structures required for leukemic trans- 
formation. Taken together, these results show that MLL 
fusion proteins target the promoter-proximal regions 
of transcriptionally active CpG-rich genes through dual 
recognition of epigenetic markers and activate transcrip- 
tion by recruiting transcriptional coactivators such as 
AF4-family proteins (Figure 7F). 

DISCUSSION 

This study demonstrates that MLL fusion proteins tar- 
get the promoter-proximal regions of previously active 
genes by simultaneously recognizing H3K36me2/3 and 
non-methylated CpGs and activate transcription by re- 
cruiting transcriptional coactivators. Our structure- 
function analysis demonstrated that an MLL-ENL 
mutant composed only of the PWWP domain, the 
CXXC domain and AHD was able to activate Hoxa9 ex- 
pression and transform myeloid progenitors. These results 
imply that the PWWP domain and the CXXC domain are 
the only structures essentially required for target recogni- 
tion by MLL fusion proteins. The PWWP domain specif- 
ically binds to either H3K36me2 or H3K36me3, both of 
which are indicators of active transcription. The CXXC 
domain binds to non-methylated CpGs, which are clus- 
tered in the active gene promoters. HOX loci are 



enriched with CpGs and are therefore preferentially sub- 
jected to MLL fusion-dependent gene activation. Our 
ChlP/CIRA analyses showed that the MLL fusion 
complex localizes at the promoter-proximal region, 
where non-methylated CpGs and H3K36me2/3 are 
present. Thus, dual recognition of the two epigenetic 
markers is the major determinant for MLL fusion 
proteins to identify their target genes. AHD is the 
platform to recruit AF4 family transcriptional 
coactivators, which confer transcriptional activation 
activity. The fundamental mechanism of MLL fusion- 
dependent gene activation is threefold: (i) LEDGF scans 
the genome and binds to H3K36me2/3 through the 
PWWP domain, (ii) The MLL fusion/menin complex 
binds to non-methylated CpGs in the promoter through 
the CXXC domain and a nearby chromatin-bound 
LEDGF. These two points of contact allow the MLL 
fusion complex to stably associate with its target chroma- 
tin, (iii) The MLL fusion complex activates transcription 
by recruiting transcriptional coactivators (Figure 7F). 
However, it should be noted that these three domains 
are the minimum elements, not all the elements, involved 
in MLL fusion-dependent leukemogenesis in the context 
of a full-length MLL fusion protein. The activity of the 
MLL fusion complex is likely regulated through intricate 
regulatory mechanisms. Factors that modulate its 
function may also play critical roles in MLL fusion- 
dependent leukemogenesis. 

Aberrant self-renewal is an essential feature of 
oncogene-driven proliferation. It requires the continuous 
expression of a gene set that is required for self-renewal 
through rounds of cell divisions; however, its mechanisms 
are largely unclear. Because the PWWP and CXXC 
domains are sufficient for target recognition, the MLL 
fusion complex identifies posterior HOXA genes by a 
chromatin context-dependent mechanism rather than by 
mediating sequence-specific transcription factors. The 
H3K36me3 marker is generally deposited into chromatin 
in a transcriptional elongation-coupled manner by the 
SETD2 methyltransferase (also known as KMT3A, 
HYPB and SET2), which associates with actively 
elongating RNAPII (41-43). The H3K36me2 marker is de- 
posited into chromatin by various SET domain-containing 
proteins, including NSD2, and is generally linked to tran- 
scriptional activation (16). Hence, previously transcribed 
genes containing H3K36me2/3 and non-methylated CpGs 
at the promoter-proximal regions are subject to gene 
activation by MLL fusion proteins. Consequently, the ex- 
pression of previously transcribed HSC program genes is 
maintained at high levels through rounds of cell division 
to cause aberrant self-renewal. 

In summary, our study identifies the minimum struc- 
tural requirements of MLL fusion proteins for HOX 
gene activation and reveals the chromatin context 
recognized by them. A surprisingly simple combination 
of epigenetic markers that defines previously active gene 
promoters is used in MLL fusion-dependent cellular 
memory maintenance. Thus, our results provide signifi- 
cant insights into the molecular mechanisms of cellular 
memory maintenance and aberrant self-renewal caused 
by MLL fusion proteins. 
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